Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 5008 |
| Missing cells | 1 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 665.3 KiB |
| Average record size in memory | 136.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Text | 14 |
| Categorical | 2 |
crew_aboard is highly overall correlated with crew_fatalities | High correlation |
crew_fatalities is highly overall correlated with crew_aboard | High correlation |
Reproduction
| Analysis started | 2023-12-06 14:51:50.409184 |
|---|---|
| Analysis finished | 2023-12-06 14:52:07.328447 |
| Duration | 16.92 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
fecha
Date
| Distinct | 4577 |
|---|---|
| Distinct (%) | 91.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
| Minimum | 1908-09-17 00:00:00 |
|---|---|
| Maximum | 2021-07-06 00:00:00 |
HORA declarada
Text
| Distinct | 1217 |
|---|---|
| Distinct (%) | 24.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 3.1565495 |
| Min length | 1 |
Characters and Unicode
| Total characters | 15808 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 585 ? |
|---|---|
| Unique (%) | 11.7% |
Sample
| 1st row | 1718 |
|---|---|
| 2nd row | ? |
| 3rd row | 0630 |
| 4th row | ? |
| 5th row | 1830 |
| Value | Count | Frequency (%) |
| 1504 | ||
| c | 36 | 0.7% |
| 1500 | 35 | 0.7% |
| 1100 | 30 | 0.6% |
| 1400 | 30 | 0.6% |
| 1700 | 29 | 0.6% |
| 1200 | 28 | 0.6% |
| 1600 | 28 | 0.6% |
| 0800 | 26 | 0.5% |
| 1900 | 25 | 0.5% |
| Other values (1189) | 3273 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3748 | |
| 1 | 2976 | |
| 2 | 1545 | |
| ? | 1504 | |
| 5 | 1383 | 8.7% |
| 3 | 1298 | 8.2% |
| 4 | 1004 | 6.4% |
| 9 | 557 | 3.5% |
| 8 | 538 | 3.4% |
| 7 | 524 | 3.3% |
| Other values (6) | 731 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14006 | |
| Other Punctuation | 1723 | 10.9% |
| Lowercase Letter | 38 | 0.2% |
| Space Separator | 36 | 0.2% |
| Uppercase Letter | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3748 | |
| 1 | 2976 | |
| 2 | 1545 | |
| 5 | 1383 | 9.9% |
| 3 | 1298 | 9.3% |
| 4 | 1004 | 7.2% |
| 9 | 557 | 4.0% |
| 8 | 538 | 3.8% |
| 7 | 524 | 3.7% |
| 6 | 433 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1504 | |
| : | 218 | 12.7% |
| ; | 1 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 38 |
Space Separator
| Value | Count | Frequency (%) |
| 36 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15765 | |
| Latin | 43 | 0.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3748 | |
| 1 | 2976 | |
| 2 | 1545 | |
| ? | 1504 | |
| 5 | 1383 | 8.8% |
| 3 | 1298 | 8.2% |
| 4 | 1004 | 6.4% |
| 9 | 557 | 3.5% |
| 8 | 538 | 3.4% |
| 7 | 524 | 3.3% |
| Other values (4) | 688 | 4.4% |
Latin
| Value | Count | Frequency (%) |
| c | 38 | |
| Z | 5 | 11.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15808 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3748 | |
| 1 | 2976 | |
| 2 | 1545 | |
| ? | 1504 | |
| 5 | 1383 | 8.7% |
| 3 | 1298 | 8.2% |
| 4 | 1004 | 6.4% |
| 9 | 557 | 3.5% |
| 8 | 538 | 3.4% |
| 7 | 524 | 3.3% |
| Other values (6) | 731 | 4.6% |
Ruta
Text
| Distinct | 4125 |
|---|---|
| Distinct (%) | 82.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 72 |
|---|---|
| Median length | 49 |
| Mean length | 20.792931 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104131 |
|---|---|
| Distinct characters | 91 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3687 ? |
|---|---|
| Unique (%) | 73.6% |
Sample
| 1st row | Fort Myer, Virginia |
|---|---|
| 2nd row | Juvisy-sur-Orge, France |
| 3rd row | Atlantic City, New Jersey |
| 4th row | Victoria, British Columbia, Canada |
| 5th row | Over the North Sea |
| Value | Count | Frequency (%) |
| near | 1350 | 9.2% |
| off | 350 | 2.4% |
| russia | 255 | 1.7% |
| new | 229 | 1.6% |
| brazil | 176 | 1.2% |
| colombia | 153 | 1.0% |
| canada | 131 | 0.9% |
| france | 127 | 0.9% |
| california | 117 | 0.8% |
| mexico | 113 | 0.8% |
| Other values (4153) | 11657 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13037 | 12.5% |
| 9703 | 9.3% | |
| e | 7073 | 6.8% |
| i | 6567 | 6.3% |
| n | 6545 | 6.3% |
| r | 6035 | 5.8% |
| o | 5367 | 5.2% |
| , | 5210 | 5.0% |
| l | 4000 | 3.8% |
| s | 3530 | 3.4% |
| Other values (81) | 37064 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 74113 | |
| Uppercase Letter | 14738 | 14.2% |
| Space Separator | 9704 | 9.3% |
| Other Punctuation | 5362 | 5.1% |
| Dash Punctuation | 105 | 0.1% |
| Decimal Number | 66 | 0.1% |
| Control | 21 | < 0.1% |
| Close Punctuation | 11 | < 0.1% |
| Open Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 13037 | |
| e | 7073 | |
| i | 6567 | |
| n | 6545 | |
| r | 6035 | 8.1% |
| o | 5367 | 7.2% |
| l | 4000 | 5.4% |
| s | 3530 | 4.8% |
| t | 3112 | 4.2% |
| u | 2756 | 3.7% |
| Other values (31) | 16091 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2032 | |
| C | 1456 | 9.9% |
| S | 1145 | 7.8% |
| M | 999 | 6.8% |
| B | 952 | 6.5% |
| A | 920 | 6.2% |
| P | 787 | 5.3% |
| I | 720 | 4.9% |
| R | 652 | 4.4% |
| O | 588 | 4.0% |
| Other values (17) | 4487 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 1 | 15 | |
| 2 | 9 | 13.6% |
| 5 | 8 | 12.1% |
| 8 | 3 | 4.5% |
| 3 | 2 | 3.0% |
| 7 | 2 | 3.0% |
| 9 | 2 | 3.0% |
| 6 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5210 | |
| . | 115 | 2.1% |
| ' | 24 | 0.4% |
| / | 6 | 0.1% |
| ? | 5 | 0.1% |
| & | 1 | < 0.1% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9703 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 16 | ||
| 5 | 23.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 105 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 88851 | |
| Common | 15280 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 13037 | |
| e | 7073 | 8.0% |
| i | 6567 | 7.4% |
| n | 6545 | 7.4% |
| r | 6035 | 6.8% |
| o | 5367 | 6.0% |
| l | 4000 | 4.5% |
| s | 3530 | 4.0% |
| t | 3112 | 3.5% |
| u | 2756 | 3.1% |
| Other values (58) | 30829 |
Common
| Value | Count | Frequency (%) |
| 9703 | ||
| , | 5210 | |
| . | 115 | 0.8% |
| - | 105 | 0.7% |
| 0 | 24 | 0.2% |
| ' | 24 | 0.2% |
| 16 | 0.1% | |
| 1 | 15 | 0.1% |
| ) | 11 | 0.1% |
| ( | 11 | 0.1% |
| Other values (13) | 46 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104089 | |
| None | 42 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 13037 | 12.5% |
| 9703 | 9.3% | |
| e | 7073 | 6.8% |
| i | 6567 | 6.3% |
| n | 6545 | 6.3% |
| r | 6035 | 5.8% |
| o | 5367 | 5.2% |
| , | 5210 | 5.0% |
| l | 4000 | 3.8% |
| s | 3530 | 3.4% |
| Other values (64) | 37022 |
None
| Value | Count | Frequency (%) |
| é | 14 | |
| ö | 5 | 11.9% |
| Ã | 4 | 9.5% |
| ó | 4 | 9.5% |
| ï | 2 | 4.8% |
| á | 2 | 4.8% |
| Ã | 1 | 2.4% |
| ô | 1 | 2.4% |
| è | 1 | 2.4% |
| ä | 1 | 2.4% |
| Other values (7) | 7 |
OperadOR
Text
| Distinct | 2268 |
|---|---|
| Distinct (%) | 45.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 65 |
|---|---|
| Median length | 47 |
| Mean length | 18.921725 |
| Min length | 1 |
Characters and Unicode
| Total characters | 94760 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1734 ? |
|---|---|
| Unique (%) | 34.6% |
Sample
| 1st row | Military - U.S. Army |
|---|---|
| 2nd row | ? |
| 3rd row | Military - U.S. Navy |
| 4th row | Private |
| 5th row | Military - German Navy |
| Value | Count | Frequency (%) |
| air | 1481 | 10.3% |
| 971 | 6.7% | |
| airlines | 840 | 5.8% |
| military | 778 | 5.4% |
| force | 557 | 3.9% |
| airways | 453 | 3.1% |
| u.s | 302 | 2.1% |
| aeroflot | 265 | 1.8% |
| lines | 184 | 1.3% |
| royal | 152 | 1.1% |
| Other values (2079) | 8422 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10212 | 10.8% |
| 9421 | 9.9% | |
| r | 8849 | 9.3% |
| a | 7786 | 8.2% |
| e | 6780 | 7.2% |
| n | 5528 | 5.8% |
| A | 5083 | 5.4% |
| o | 4380 | 4.6% |
| l | 4079 | 4.3% |
| s | 4000 | 4.2% |
| Other values (77) | 28642 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 68181 | |
| Uppercase Letter | 15071 | 15.9% |
| Space Separator | 9422 | 9.9% |
| Dash Punctuation | 939 | 1.0% |
| Other Punctuation | 879 | 0.9% |
| Open Punctuation | 115 | 0.1% |
| Close Punctuation | 115 | 0.1% |
| Decimal Number | 30 | < 0.1% |
| Control | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10212 | |
| r | 8849 | |
| a | 7786 | |
| e | 6780 | |
| n | 5528 | |
| o | 4380 | |
| l | 4079 | 6.0% |
| s | 4000 | 5.9% |
| t | 3921 | 5.8% |
| c | 1996 | 2.9% |
| Other values (28) | 10650 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 5083 | |
| M | 1217 | 8.1% |
| S | 1138 | 7.6% |
| C | 910 | 6.0% |
| F | 901 | 6.0% |
| T | 679 | 4.5% |
| L | 661 | 4.4% |
| U | 534 | 3.5% |
| P | 513 | 3.4% |
| N | 496 | 3.3% |
| Other values (16) | 2939 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 7 | 4 | |
| 4 | 4 | |
| 1 | 3 | |
| 2 | 3 | |
| 5 | 3 | |
| 6 | 2 | 6.7% |
| 8 | 2 | 6.7% |
| 9 | 2 | 6.7% |
| 3 | 2 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 718 | |
| / | 109 | 12.4% |
| ' | 25 | 2.8% |
| ? | 11 | 1.3% |
| , | 10 | 1.1% |
| & | 6 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 9421 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 6 | ||
| 2 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 939 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83252 | |
| Common | 11508 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10212 | |
| r | 8849 | 10.6% |
| a | 7786 | 9.4% |
| e | 6780 | 8.1% |
| n | 5528 | 6.6% |
| A | 5083 | 6.1% |
| o | 4380 | 5.3% |
| l | 4079 | 4.9% |
| s | 4000 | 4.8% |
| t | 3921 | 4.7% |
| Other values (54) | 22634 |
Common
| Value | Count | Frequency (%) |
| 9421 | ||
| - | 939 | 8.2% |
| . | 718 | 6.2% |
| ( | 115 | 1.0% |
| ) | 115 | 1.0% |
| / | 109 | 0.9% |
| ' | 25 | 0.2% |
| ? | 11 | 0.1% |
| , | 10 | 0.1% |
| & | 6 | 0.1% |
| Other values (13) | 39 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94637 | |
| None | 123 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 10212 | 10.8% |
| 9421 | 10.0% | |
| r | 8849 | 9.4% |
| a | 7786 | 8.2% |
| e | 6780 | 7.2% |
| n | 5528 | 5.8% |
| A | 5083 | 5.4% |
| o | 4380 | 4.6% |
| l | 4079 | 4.3% |
| s | 4000 | 4.2% |
| Other values (64) | 28519 |
None
| Value | Count | Frequency (%) |
| é | 102 | |
| á | 5 | 4.1% |
| Ã | 2 | 1.6% |
| ï | 2 | 1.6% |
| ó | 2 | 1.6% |
| Ã | 2 | 1.6% |
| ç | 2 | 1.6% |
| ã | 1 | 0.8% |
| ú | 1 | 0.8% |
| ê | 1 | 0.8% |
| Other values (3) | 3 | 2.4% |
flight_no
Text
| Distinct | 893 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 1 |
| Mean length | 1.5928514 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7977 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 656 ? |
|---|---|
| Unique (%) | 13.1% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | ? |
| 3rd row | ? |
| 4th row | ? |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 3728 | ||
| 1 | 11 | 0.2% |
| 101 | 10 | 0.2% |
| 6 | 8 | 0.2% |
| 4 | 7 | 0.1% |
| 901 | 7 | 0.1% |
| 115 | 6 | 0.1% |
| 301 | 6 | 0.1% |
| 201 | 6 | 0.1% |
| 703 | 6 | 0.1% |
| Other values (883) | 1235 | 24.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| ? | 3683 | |
| 1 | 638 | 8.0% |
| 0 | 497 | 6.2% |
| 2 | 495 | 6.2% |
| 3 | 417 | 5.2% |
| 5 | 385 | 4.8% |
| 4 | 347 | 4.4% |
| 6 | 330 | 4.1% |
| 7 | 316 | 4.0% |
| 8 | 291 | 3.6% |
| Other values (37) | 578 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3986 | |
| Other Punctuation | 3715 | |
| Uppercase Letter | 156 | 2.0% |
| Dash Punctuation | 87 | 1.1% |
| Space Separator | 22 | 0.3% |
| Lowercase Letter | 11 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 21 | |
| S | 14 | 9.0% |
| H | 13 | 8.3% |
| P | 11 | 7.1% |
| F | 10 | 6.4% |
| C | 10 | 6.4% |
| U | 8 | 5.1% |
| R | 7 | 4.5% |
| I | 7 | 4.5% |
| L | 7 | 4.5% |
| Other values (15) | 48 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 638 | |
| 0 | 497 | |
| 2 | 495 | |
| 3 | 417 | |
| 5 | 385 | |
| 4 | 347 | |
| 6 | 330 | |
| 7 | 316 | |
| 8 | 291 | |
| 9 | 270 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2 | |
| a | 2 | |
| r | 2 | |
| o | 1 | |
| y | 1 | |
| h | 1 | |
| t | 1 | |
| e | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 3683 | |
| / | 32 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 87 |
Space Separator
| Value | Count | Frequency (%) |
| 22 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7810 | |
| Latin | 167 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 21 | 12.6% |
| S | 14 | 8.4% |
| H | 13 | 7.8% |
| P | 11 | 6.6% |
| F | 10 | 6.0% |
| C | 10 | 6.0% |
| U | 8 | 4.8% |
| R | 7 | 4.2% |
| I | 7 | 4.2% |
| L | 7 | 4.2% |
| Other values (23) | 59 |
Common
| Value | Count | Frequency (%) |
| ? | 3683 | |
| 1 | 638 | 8.2% |
| 0 | 497 | 6.4% |
| 2 | 495 | 6.3% |
| 3 | 417 | 5.3% |
| 5 | 385 | 4.9% |
| 4 | 347 | 4.4% |
| 6 | 330 | 4.2% |
| 7 | 316 | 4.0% |
| 8 | 291 | 3.7% |
| Other values (4) | 411 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7977 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ? | 3683 | |
| 1 | 638 | 8.0% |
| 0 | 497 | 6.2% |
| 2 | 495 | 6.2% |
| 3 | 417 | 5.2% |
| 5 | 385 | 4.8% |
| 4 | 347 | 4.4% |
| 6 | 330 | 4.1% |
| 7 | 316 | 4.0% |
| 8 | 291 | 3.6% |
| Other values (37) | 578 | 7.2% |
route
Text
| Distinct | 3838 |
|---|---|
| Distinct (%) | 76.7% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 39.3 KiB |
Length
| Max length | 59 |
|---|---|
| Median length | 52 |
| Mean length | 18.948472 |
| Min length | 1 |
Characters and Unicode
| Total characters | 94875 |
|---|---|
| Distinct characters | 92 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 3630 ? |
|---|---|
| Unique (%) | 72.5% |
Sample
| 1st row | Demonstration |
|---|---|
| 2nd row | Air show |
| 3rd row | Test flight |
| 4th row | ? |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 5395 | ||
| city | 213 | 1.2% |
| new | 149 | 0.8% |
| san | 140 | 0.8% |
| york | 117 | 0.7% |
| paris | 116 | 0.7% |
| training | 103 | 0.6% |
| de | 101 | 0.6% |
| london | 88 | 0.5% |
| moscow | 84 | 0.5% |
| Other values (3627) | 11082 |
Most occurring characters
| Value | Count | Frequency (%) |
| 12646 | 13.3% | |
| a | 9833 | 10.4% |
| n | 5568 | 5.9% |
| o | 5503 | 5.8% |
| i | 5244 | 5.5% |
| e | 5107 | 5.4% |
| - | 4927 | 5.2% |
| r | 4487 | 4.7% |
| l | 3420 | 3.6% |
| s | 3075 | 3.2% |
| Other values (82) | 35065 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 62474 | |
| Uppercase Letter | 12941 | 13.6% |
| Space Separator | 12647 | 13.3% |
| Dash Punctuation | 4931 | 5.2% |
| Other Punctuation | 1827 | 1.9% |
| Control | 30 | < 0.1% |
| Decimal Number | 16 | < 0.1% |
| Final Punctuation | 4 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 9833 | |
| n | 5568 | 8.9% |
| o | 5503 | 8.8% |
| i | 5244 | 8.4% |
| e | 5107 | 8.2% |
| r | 4487 | 7.2% |
| l | 3420 | 5.5% |
| s | 3075 | 4.9% |
| t | 3006 | 4.8% |
| u | 2566 | 4.1% |
| Other values (30) | 14665 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1232 | 9.5% |
| B | 1140 | 8.8% |
| S | 1081 | 8.4% |
| A | 1046 | 8.1% |
| M | 1042 | 8.1% |
| P | 823 | 6.4% |
| L | 788 | 6.1% |
| T | 710 | 5.5% |
| K | 640 | 4.9% |
| N | 630 | 4.9% |
| Other values (18) | 3809 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 3 | |
| 4 | 3 | |
| 2 | 2 | |
| 7 | 2 | |
| 8 | 1 | 6.2% |
| 6 | 1 | 6.2% |
| 0 | 1 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 915 | |
| ? | 768 | |
| . | 98 | 5.4% |
| / | 20 | 1.1% |
| ' | 20 | 1.1% |
| : | 5 | 0.3% |
| \ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12646 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4927 | |
| – | 4 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 29 | ||
| 1 | 3.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 75415 | |
| Common | 19460 | 20.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 9833 | 13.0% |
| n | 5568 | 7.4% |
| o | 5503 | 7.3% |
| i | 5244 | 7.0% |
| e | 5107 | 6.8% |
| r | 4487 | 5.9% |
| l | 3420 | 4.5% |
| s | 3075 | 4.1% |
| t | 3006 | 4.0% |
| u | 2566 | 3.4% |
| Other values (58) | 27606 |
Common
| Value | Count | Frequency (%) |
| 12646 | ||
| - | 4927 | 25.3% |
| , | 915 | 4.7% |
| ? | 768 | 3.9% |
| . | 98 | 0.5% |
| 29 | 0.1% | |
| / | 20 | 0.1% |
| ' | 20 | 0.1% |
| : | 5 | < 0.1% |
| – | 4 | < 0.1% |
| Other values (14) | 28 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94746 | |
| None | 121 | 0.1% |
| Punctuation | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12646 | 13.3% | |
| a | 9833 | 10.4% |
| n | 5568 | 5.9% |
| o | 5503 | 5.8% |
| i | 5244 | 5.5% |
| e | 5107 | 5.4% |
| - | 4927 | 5.2% |
| r | 4487 | 4.7% |
| l | 3420 | 3.6% |
| s | 3075 | 3.2% |
| Other values (63) | 34936 |
None
| Value | Count | Frequency (%) |
| é | 38 | |
| Ã | 21 | |
| á | 15 | 12.4% |
| ó | 14 | 11.6% |
| ü | 6 | 5.0% |
| ã | 6 | 5.0% |
| ç | 4 | 3.3% |
| è | 4 | 3.3% |
| ÃŽ | 3 | 2.5% |
| ö | 2 | 1.7% |
| Other values (7) | 8 | 6.6% |
Punctuation
| Value | Count | Frequency (%) |
| – | 4 | |
| ’ | 4 |
ac_type
Text
| Distinct | 2469 |
|---|---|
| Distinct (%) | 49.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 18.496006 |
| Min length | 1 |
Characters and Unicode
| Total characters | 92628 |
|---|---|
| Distinct characters | 78 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1863 ? |
|---|---|
| Unique (%) | 37.2% |
Sample
| 1st row | Wright Flyer III |
|---|---|
| 2nd row | Wright Byplane |
| 3rd row | Dirigible |
| 4th row | Curtiss seaplane |
| 5th row | Zeppelin L-1 (airship) |
| Value | Count | Frequency (%) |
| douglas | 1130 | 8.3% |
| boeing | 418 | 3.1% |
| dc-3 | 387 | 2.8% |
| lockheed | 332 | 2.4% |
| de | 294 | 2.2% |
| havilland | 292 | 2.1% |
| antonov | 288 | 2.1% |
| canada | 159 | 1.2% |
| otter | 146 | 1.1% |
| ilyushin | 142 | 1.0% |
| Other values (2525) | 10038 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8649 | 9.3% | |
| - | 5180 | 5.6% |
| e | 4842 | 5.2% |
| o | 4638 | 5.0% |
| a | 4636 | 5.0% |
| n | 3856 | 4.2% |
| l | 3696 | 4.0% |
| i | 3486 | 3.8% |
| r | 3306 | 3.6% |
| C | 3034 | 3.3% |
| Other values (68) | 47305 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46427 | |
| Uppercase Letter | 17900 | 19.3% |
| Decimal Number | 13808 | 14.9% |
| Space Separator | 8650 | 9.3% |
| Dash Punctuation | 5180 | 5.6% |
| Other Punctuation | 277 | 0.3% |
| Open Punctuation | 190 | 0.2% |
| Close Punctuation | 189 | 0.2% |
| Math Symbol | 3 | < 0.1% |
| Control | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4842 | |
| o | 4638 | |
| a | 4636 | |
| n | 3856 | 8.3% |
| l | 3696 | 8.0% |
| i | 3486 | 7.5% |
| r | 3306 | 7.1% |
| s | 2917 | 6.3% |
| t | 2357 | 5.1% |
| u | 2217 | 4.8% |
| Other values (18) | 10476 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3034 | |
| D | 2819 | |
| A | 1901 | |
| B | 1728 | |
| H | 1016 | 5.7% |
| L | 883 | 4.9% |
| F | 796 | 4.4% |
| S | 790 | 4.4% |
| I | 642 | 3.6% |
| T | 620 | 3.5% |
| Other values (16) | 3671 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2167 | |
| 0 | 2103 | |
| 1 | 2017 | |
| 3 | 1706 | |
| 4 | 1704 | |
| 7 | 1494 | |
| 6 | 875 | |
| 5 | 713 | 5.2% |
| 8 | 664 | 4.8% |
| 9 | 365 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 185 | |
| . | 76 | |
| ? | 13 | 4.7% |
| , | 2 | 0.7% |
| & | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8649 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5180 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 190 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 189 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Control
| Value | Count | Frequency (%) |
| 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64327 | |
| Common | 28301 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4842 | 7.5% |
| o | 4638 | 7.2% |
| a | 4636 | 7.2% |
| n | 3856 | 6.0% |
| l | 3696 | 5.7% |
| i | 3486 | 5.4% |
| r | 3306 | 5.1% |
| C | 3034 | 4.7% |
| s | 2917 | 4.5% |
| D | 2819 | 4.4% |
| Other values (44) | 27097 |
Common
| Value | Count | Frequency (%) |
| 8649 | ||
| - | 5180 | |
| 2 | 2167 | 7.7% |
| 0 | 2103 | 7.4% |
| 1 | 2017 | 7.1% |
| 3 | 1706 | 6.0% |
| 4 | 1704 | 6.0% |
| 7 | 1494 | 5.3% |
| 6 | 875 | 3.1% |
| 5 | 713 | 2.5% |
| Other values (14) | 1693 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 92609 | |
| None | 17 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8649 | 9.3% | |
| - | 5180 | 5.6% |
| e | 4842 | 5.2% |
| o | 4638 | 5.0% |
| a | 4636 | 5.0% |
| n | 3856 | 4.2% |
| l | 3696 | 4.0% |
| i | 3486 | 3.8% |
| r | 3306 | 3.6% |
| C | 3034 | 3.3% |
| Other values (63) | 47286 |
None
| Value | Count | Frequency (%) |
| é | 12 | |
| è | 4 | 23.5% |
| Â | 1 | 5.9% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 | |
| ’ | 1 |
registration
Text
| Distinct | 4701 |
|---|---|
| Distinct (%) | 93.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 6.1956869 |
| Min length | 1 |
Characters and Unicode
| Total characters | 31028 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4665 ? |
|---|---|
| Unique (%) | 93.2% |
Sample
| 1st row | ? |
|---|---|
| 2nd row | SC1 |
| 3rd row | ? |
| 4th row | ? |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 311 | 6.1% | |
| hk | 4 | 0.1% |
| 49 | 3 | 0.1% |
| f-aeej | 2 | < 0.1% |
| 32 | 2 | < 0.1% |
| 82 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| cf-tcl | 2 | < 0.1% |
| 12406 | 2 | < 0.1% |
| f-bbdm | 2 | < 0.1% |
| Other values (4732) | 4772 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 3497 | 11.3% |
| C | 2022 | 6.5% |
| A | 1711 | 5.5% |
| 1 | 1541 | 5.0% |
| N | 1432 | 4.6% |
| 2 | 1246 | 4.0% |
| P | 1193 | 3.8% |
| 4 | 1187 | 3.8% |
| 5 | 1132 | 3.6% |
| 0 | 1098 | 3.5% |
| Other values (39) | 14969 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 15946 | |
| Decimal Number | 11081 | |
| Dash Punctuation | 3497 | 11.3% |
| Other Punctuation | 391 | 1.3% |
| Space Separator | 90 | 0.3% |
| Control | 12 | < 0.1% |
| Lowercase Letter | 10 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2022 | 12.7% |
| A | 1711 | 10.7% |
| N | 1432 | 9.0% |
| P | 1193 | 7.5% |
| B | 718 | 4.5% |
| F | 690 | 4.3% |
| H | 636 | 4.0% |
| T | 611 | 3.8% |
| E | 560 | 3.5% |
| G | 559 | 3.5% |
| Other values (16) | 5814 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1541 | |
| 2 | 1246 | |
| 4 | 1187 | |
| 5 | 1132 | |
| 0 | 1098 | |
| 3 | 1037 | |
| 6 | 1026 | |
| 7 | 1015 | |
| 8 | 912 | |
| 9 | 887 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 5 | |
| y | 1 | 10.0% |
| e | 1 | 10.0% |
| o | 1 | 10.0% |
| w | 1 | 10.0% |
| d | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 277 | |
| / | 114 |
Control
| Value | Count | Frequency (%) |
| 10 | ||
| 2 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3497 |
Space Separator
| Value | Count | Frequency (%) |
| 90 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15956 | |
| Common | 15072 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 2022 | 12.7% |
| A | 1711 | 10.7% |
| N | 1432 | 9.0% |
| P | 1193 | 7.5% |
| B | 718 | 4.5% |
| F | 690 | 4.3% |
| H | 636 | 4.0% |
| T | 611 | 3.8% |
| E | 560 | 3.5% |
| G | 559 | 3.5% |
| Other values (22) | 5824 |
Common
| Value | Count | Frequency (%) |
| - | 3497 | |
| 1 | 1541 | |
| 2 | 1246 | 8.3% |
| 4 | 1187 | 7.9% |
| 5 | 1132 | 7.5% |
| 0 | 1098 | 7.3% |
| 3 | 1037 | 6.9% |
| 6 | 1026 | 6.8% |
| 7 | 1015 | 6.7% |
| 8 | 912 | 6.1% |
| Other values (7) | 1381 | 9.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31028 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 3497 | 11.3% |
| C | 2022 | 6.5% |
| A | 1711 | 5.5% |
| 1 | 1541 | 5.0% |
| N | 1432 | 4.6% |
| 2 | 1246 | 4.0% |
| P | 1193 | 3.8% |
| 4 | 1187 | 3.8% |
| 5 | 1132 | 3.6% |
| 0 | 1098 | 3.5% |
| Other values (39) | 14969 |
cn_ln
Text
| Distinct | 3908 |
|---|---|
| Distinct (%) | 78.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 4.9239217 |
| Min length | 1 |
Characters and Unicode
| Total characters | 24659 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3608 ? |
|---|---|
| Unique (%) | 72.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | ? |
| 3rd row | ? |
| 4th row | ? |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 724 | 14.1% | |
| 1 | 10 | 0.2% |
| 4 | 9 | 0.2% |
| 125 | 7 | 0.1% |
| 3 | 7 | 0.1% |
| 30 | 7 | 0.1% |
| 229 | 6 | 0.1% |
| 2 | 5 | 0.1% |
| 18 | 5 | 0.1% |
| 213 | 5 | 0.1% |
| Other values (3928) | 4359 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3485 | |
| 0 | 3141 | |
| 2 | 2641 | |
| 4 | 2366 | |
| 3 | 2343 | |
| 5 | 1861 | |
| 6 | 1593 | |
| 9 | 1582 | |
| 7 | 1577 | |
| 8 | 1537 | |
| Other values (34) | 2533 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 22126 | |
| Other Punctuation | 1380 | 5.6% |
| Uppercase Letter | 582 | 2.4% |
| Dash Punctuation | 430 | 1.7% |
| Space Separator | 136 | 0.6% |
| Control | 3 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 125 | |
| B | 65 | |
| C | 62 | |
| S | 55 | |
| T | 45 | 7.7% |
| H | 32 | 5.5% |
| U | 26 | 4.5% |
| G | 20 | 3.4% |
| N | 20 | 3.4% |
| E | 17 | 2.9% |
| Other values (14) | 115 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3485 | |
| 0 | 3141 | |
| 2 | 2641 | |
| 4 | 2366 | |
| 3 | 2343 | |
| 5 | 1861 | |
| 6 | 1593 | |
| 9 | 1582 | |
| 7 | 1577 | |
| 8 | 1537 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 699 | |
| ? | 679 | |
| : | 1 | 0.1% |
| . | 1 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 2 | ||
| 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 430 |
Space Separator
| Value | Count | Frequency (%) |
| 136 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24077 | |
| Latin | 582 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 125 | |
| B | 65 | |
| C | 62 | |
| S | 55 | |
| T | 45 | 7.7% |
| H | 32 | 5.5% |
| U | 26 | 4.5% |
| G | 20 | 3.4% |
| N | 20 | 3.4% |
| E | 17 | 2.9% |
| Other values (14) | 115 |
Common
| Value | Count | Frequency (%) |
| 1 | 3485 | |
| 0 | 3141 | |
| 2 | 2641 | |
| 4 | 2366 | |
| 3 | 2343 | |
| 5 | 1861 | |
| 6 | 1593 | |
| 9 | 1582 | |
| 7 | 1577 | |
| 8 | 1537 | |
| Other values (10) | 1951 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24659 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3485 | |
| 0 | 3141 | |
| 2 | 2641 | |
| 4 | 2366 | |
| 3 | 2343 | |
| 5 | 1861 | |
| 6 | 1593 | |
| 9 | 1582 | |
| 7 | 1577 | |
| 8 | 1537 | |
| Other values (34) | 2533 |
all_aboard
Text
| Distinct | 245 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.7360224 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8694 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 5 |
| 4th row | 1 |
| 5th row | 20 |
| Value | Count | Frequency (%) |
| 3 | 280 | 5.6% |
| 2 | 246 | 4.9% |
| 4 | 202 | 4.0% |
| 5 | 190 | 3.8% |
| 10 | 179 | 3.6% |
| 6 | 174 | 3.5% |
| 7 | 164 | 3.3% |
| 1 | 139 | 2.8% |
| 9 | 130 | 2.6% |
| 11 | 128 | 2.6% |
| Other values (235) | 3176 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2009 | |
| 2 | 1411 | |
| 3 | 1042 | |
| 4 | 832 | |
| 5 | 694 | 8.0% |
| 6 | 607 | 7.0% |
| 7 | 570 | 6.6% |
| 0 | 540 | 6.2% |
| 8 | 504 | 5.8% |
| 9 | 468 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8677 | |
| Other Punctuation | 17 | 0.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2009 | |
| 2 | 1411 | |
| 3 | 1042 | |
| 4 | 832 | |
| 5 | 694 | 8.0% |
| 6 | 607 | 7.0% |
| 7 | 570 | 6.6% |
| 0 | 540 | 6.2% |
| 8 | 504 | 5.8% |
| 9 | 468 | 5.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2009 | |
| 2 | 1411 | |
| 3 | 1042 | |
| 4 | 832 | |
| 5 | 694 | 8.0% |
| 6 | 607 | 7.0% |
| 7 | 570 | 6.6% |
| 0 | 540 | 6.2% |
| 8 | 504 | 5.8% |
| 9 | 468 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2009 | |
| 2 | 1411 | |
| 3 | 1042 | |
| 4 | 832 | |
| 5 | 694 | 8.0% |
| 6 | 607 | 7.0% |
| 7 | 570 | 6.6% |
| 0 | 540 | 6.2% |
| 8 | 504 | 5.8% |
| 9 | 468 | 5.4% |
| Distinct | 235 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.5988419 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8007 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 0 | 869 | 17.4% |
| 221 | 4.4% | |
| 4 | 170 | 3.4% |
| 2 | 162 | 3.2% |
| 5 | 140 | 2.8% |
| 7 | 130 | 2.6% |
| 3 | 130 | 2.6% |
| 10 | 128 | 2.6% |
| 9 | 128 | 2.6% |
| 8 | 126 | 2.5% |
| Other values (225) | 2804 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1687 | |
| 0 | 1282 | |
| 2 | 1026 | |
| 3 | 730 | |
| 4 | 692 | |
| 5 | 599 | 7.5% |
| 7 | 473 | 5.9% |
| 6 | 465 | 5.8% |
| 9 | 416 | 5.2% |
| 8 | 416 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7786 | |
| Other Punctuation | 221 | 2.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1687 | |
| 0 | 1282 | |
| 2 | 1026 | |
| 3 | 730 | |
| 4 | 692 | |
| 5 | 599 | 7.7% |
| 7 | 473 | 6.1% |
| 6 | 465 | 6.0% |
| 9 | 416 | 5.3% |
| 8 | 416 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 221 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8007 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1687 | |
| 0 | 1282 | |
| 2 | 1026 | |
| 3 | 730 | |
| 4 | 692 | |
| 5 | 599 | 7.5% |
| 7 | 473 | 5.9% |
| 6 | 465 | 5.8% |
| 9 | 416 | 5.2% |
| 8 | 416 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8007 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1687 | |
| 0 | 1282 | |
| 2 | 1026 | |
| 3 | 730 | |
| 4 | 692 | |
| 5 | 599 | 7.5% |
| 7 | 473 | 5.9% |
| 6 | 465 | 5.8% |
| 9 | 416 | 5.2% |
| 8 | 416 | 5.2% |
crew_aboard
Categorical
HIGH CORRELATION 
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
| 3 | |
|---|---|
| 2 | |
| 4 | |
| 1 | |
| 5 | |
| Other values (30) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0698882 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5358 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 5 |
| 4th row | 1 |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| 3 | 954 | |
| 2 | 828 | |
| 4 | 694 | |
| 1 | 535 | |
| 5 | 514 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| ? | 219 | 4.4% |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| Other values (25) | 357 | 7.1% |
Length
| Value | Count | Frequency (%) |
| 3 | 954 | |
| 2 | 828 | |
| 4 | 694 | |
| 1 | 535 | |
| 5 | 514 | |
| 6 | 375 | 7.5% |
| 7 | 244 | 4.9% |
| 219 | 4.4% | |
| 8 | 173 | 3.5% |
| 9 | 115 | 2.3% |
| Other values (25) | 357 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 993 | |
| 1 | 920 | |
| 2 | 902 | |
| 4 | 727 | |
| 5 | 538 | |
| 6 | 389 | 7.3% |
| 7 | 252 | 4.7% |
| ? | 219 | 4.1% |
| 8 | 181 | 3.4% |
| 9 | 126 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5139 | |
| Other Punctuation | 219 | 4.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 993 | |
| 1 | 920 | |
| 2 | 902 | |
| 4 | 727 | |
| 5 | 538 | |
| 6 | 389 | 7.6% |
| 7 | 252 | 4.9% |
| 8 | 181 | 3.5% |
| 9 | 126 | 2.5% |
| 0 | 111 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 219 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5358 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 993 | |
| 1 | 920 | |
| 2 | 902 | |
| 4 | 727 | |
| 5 | 538 | |
| 6 | 389 | 7.3% |
| 7 | 252 | 4.7% |
| ? | 219 | 4.1% |
| 8 | 181 | 3.4% |
| 9 | 126 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 993 | |
| 1 | 920 | |
| 2 | 902 | |
| 4 | 727 | |
| 5 | 538 | |
| 6 | 389 | 7.3% |
| 7 | 252 | 4.7% |
| ? | 219 | 4.1% |
| 8 | 181 | 3.4% |
| 9 | 126 | 2.4% |
| Distinct | 200 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 1.5842652 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7934 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 5 |
| 4th row | 1 |
| 5th row | 14 |
| Value | Count | Frequency (%) |
| 1 | 384 | 7.7% |
| 2 | 377 | 7.5% |
| 3 | 363 | 7.2% |
| 4 | 242 | 4.8% |
| 5 | 235 | 4.7% |
| 6 | 176 | 3.5% |
| 7 | 160 | 3.2% |
| 10 | 159 | 3.2% |
| 13 | 132 | 2.6% |
| 9 | 128 | 2.6% |
| Other values (190) | 2652 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1943 | |
| 2 | 1370 | |
| 3 | 990 | |
| 4 | 743 | 9.4% |
| 5 | 618 | 7.8% |
| 0 | 511 | 6.4% |
| 6 | 486 | 6.1% |
| 7 | 484 | 6.1% |
| 8 | 421 | 5.3% |
| 9 | 360 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7926 | |
| Other Punctuation | 8 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1943 | |
| 2 | 1370 | |
| 3 | 990 | |
| 4 | 743 | 9.4% |
| 5 | 618 | 7.8% |
| 0 | 511 | 6.4% |
| 6 | 486 | 6.1% |
| 7 | 484 | 6.1% |
| 8 | 421 | 5.3% |
| 9 | 360 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7934 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1943 | |
| 2 | 1370 | |
| 3 | 990 | |
| 4 | 743 | 9.4% |
| 5 | 618 | 7.8% |
| 0 | 511 | 6.4% |
| 6 | 486 | 6.1% |
| 7 | 484 | 6.1% |
| 8 | 421 | 5.3% |
| 9 | 360 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1943 | |
| 2 | 1370 | |
| 3 | 990 | |
| 4 | 743 | 9.4% |
| 5 | 618 | 7.8% |
| 0 | 511 | 6.4% |
| 6 | 486 | 6.1% |
| 7 | 484 | 6.1% |
| 8 | 421 | 5.3% |
| 9 | 360 | 4.5% |
| Distinct | 191 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.4620607 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7322 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | ? |
| Value | Count | Frequency (%) |
| 0 | 1040 | |
| 1 | 308 | 6.2% |
| 2 | 263 | 5.3% |
| 235 | 4.7% | |
| 3 | 193 | 3.9% |
| 4 | 185 | 3.7% |
| 5 | 139 | 2.8% |
| 6 | 133 | 2.7% |
| 7 | 126 | 2.5% |
| 8 | 126 | 2.5% |
| Other values (181) | 2260 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1582 | |
| 0 | 1367 | |
| 2 | 974 | |
| 3 | 669 | |
| 4 | 569 | 7.8% |
| 5 | 488 | 6.7% |
| 6 | 405 | 5.5% |
| 7 | 388 | 5.3% |
| 8 | 334 | 4.6% |
| 9 | 311 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7087 | |
| Other Punctuation | 235 | 3.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1582 | |
| 0 | 1367 | |
| 2 | 974 | |
| 3 | 669 | |
| 4 | 569 | 8.0% |
| 5 | 488 | 6.9% |
| 6 | 405 | 5.7% |
| 7 | 388 | 5.5% |
| 8 | 334 | 4.7% |
| 9 | 311 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7322 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1582 | |
| 0 | 1367 | |
| 2 | 974 | |
| 3 | 669 | |
| 4 | 569 | 7.8% |
| 5 | 488 | 6.7% |
| 6 | 405 | 5.5% |
| 7 | 388 | 5.3% |
| 8 | 334 | 4.6% |
| 9 | 311 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1582 | |
| 0 | 1367 | |
| 2 | 974 | |
| 3 | 669 | |
| 4 | 569 | 7.8% |
| 5 | 488 | 6.7% |
| 6 | 405 | 5.5% |
| 7 | 388 | 5.3% |
| 8 | 334 | 4.6% |
| 9 | 311 | 4.2% |
crew_fatalities
Categorical
HIGH CORRELATION 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
| 2 | |
|---|---|
| 3 | |
| 1 | |
| 4 | |
| 5 | |
| Other values (24) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0463259 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5240 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 5 |
| 4th row | 1 |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| 2 | 892 | |
| 3 | 824 | |
| 1 | 771 | |
| 4 | 591 | |
| 5 | 402 | |
| 0 | 400 | |
| 6 | 273 | 5.5% |
| ? | 235 | 4.7% |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| Other values (19) | 319 | 6.4% |
Length
| Value | Count | Frequency (%) |
| 2 | 892 | |
| 3 | 824 | |
| 1 | 771 | |
| 4 | 591 | |
| 5 | 402 | |
| 0 | 400 | |
| 6 | 273 | 5.5% |
| 235 | 4.7% | |
| 7 | 171 | 3.4% |
| 8 | 130 | 2.6% |
| Other values (19) | 319 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1026 | |
| 2 | 940 | |
| 3 | 857 | |
| 4 | 615 | |
| 0 | 471 | |
| 5 | 415 | |
| 6 | 278 | 5.3% |
| ? | 235 | 4.5% |
| 7 | 178 | 3.4% |
| 8 | 133 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5005 | |
| Other Punctuation | 235 | 4.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1026 | |
| 2 | 940 | |
| 3 | 857 | |
| 4 | 615 | |
| 0 | 471 | |
| 5 | 415 | |
| 6 | 278 | 5.6% |
| 7 | 178 | 3.6% |
| 8 | 133 | 2.7% |
| 9 | 92 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5240 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1026 | |
| 2 | 940 | |
| 3 | 857 | |
| 4 | 615 | |
| 0 | 471 | |
| 5 | 415 | |
| 6 | 278 | 5.3% |
| ? | 235 | 4.5% |
| 7 | 178 | 3.4% |
| 8 | 133 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1026 | |
| 2 | 940 | |
| 3 | 857 | |
| 4 | 615 | |
| 0 | 471 | |
| 5 | 415 | |
| 6 | 278 | 5.3% |
| ? | 235 | 4.5% |
| 7 | 178 | 3.4% |
| 8 | 133 | 2.5% |
ground
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.0167732 |
| Min length | 1 |
Characters and Unicode
| Total characters | 5092 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 4716 | |
| 1 | 63 | 1.3% |
| 44 | 0.9% | |
| 2 | 34 | 0.7% |
| 3 | 21 | 0.4% |
| 4 | 16 | 0.3% |
| 5 | 12 | 0.2% |
| 7 | 10 | 0.2% |
| 8 | 9 | 0.2% |
| 10 | 6 | 0.1% |
| Other values (42) | 77 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4731 | |
| 1 | 104 | 2.0% |
| 2 | 62 | 1.2% |
| ? | 44 | 0.9% |
| 3 | 40 | 0.8% |
| 4 | 34 | 0.7% |
| 5 | 28 | 0.5% |
| 7 | 18 | 0.4% |
| 8 | 14 | 0.3% |
| 6 | 9 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5048 | |
| Other Punctuation | 44 | 0.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4731 | |
| 1 | 104 | 2.1% |
| 2 | 62 | 1.2% |
| 3 | 40 | 0.8% |
| 4 | 34 | 0.7% |
| 5 | 28 | 0.6% |
| 7 | 18 | 0.4% |
| 8 | 14 | 0.3% |
| 6 | 9 | 0.2% |
| 9 | 8 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 44 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5092 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4731 | |
| 1 | 104 | 2.0% |
| 2 | 62 | 1.2% |
| ? | 44 | 0.9% |
| 3 | 40 | 0.8% |
| 4 | 34 | 0.7% |
| 5 | 28 | 0.5% |
| 7 | 18 | 0.4% |
| 8 | 14 | 0.3% |
| 6 | 9 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5092 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4731 | |
| 1 | 104 | 2.0% |
| 2 | 62 | 1.2% |
| ? | 44 | 0.9% |
| 3 | 40 | 0.8% |
| 4 | 34 | 0.7% |
| 5 | 28 | 0.5% |
| 7 | 18 | 0.4% |
| 8 | 14 | 0.3% |
| 6 | 9 | 0.2% |
summary
Text
| Distinct | 4858 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.3 KiB |
Length
| Max length | 2669 |
|---|---|
| Median length | 791 |
| Mean length | 220.77376 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1105635 |
|---|---|
| Distinct characters | 102 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 4813 ? |
|---|---|
| Unique (%) | 96.1% |
Sample
| 1st row | During a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later. |
|---|---|
| 2nd row | Eugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show. |
| 3rd row | First U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight. |
| 4th row | The first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed. |
| 5th row | The airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants. |
| Value | Count | Frequency (%) |
| the | 18463 | 10.1% |
| of | 5544 | 3.0% |
| a | 5456 | 3.0% |
| and | 5444 | 3.0% |
| to | 5429 | 3.0% |
| in | 3682 | 2.0% |
| crashed | 3386 | 1.8% |
| was | 2779 | 1.5% |
| aircraft | 2557 | 1.4% |
| into | 2360 | 1.3% |
| Other values (11568) | 128035 |
Most occurring characters
| Value | Count | Frequency (%) |
| 179362 | ||
| e | 104905 | 9.5% |
| t | 81905 | 7.4% |
| a | 79924 | 7.2% |
| n | 68116 | 6.2% |
| i | 65870 | 6.0% |
| r | 63437 | 5.7% |
| o | 62600 | 5.7% |
| h | 42794 | 3.9% |
| s | 39810 | 3.6% |
| Other values (92) | 316912 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 869373 | |
| Space Separator | 179369 | 16.2% |
| Uppercase Letter | 25294 | 2.3% |
| Other Punctuation | 20683 | 1.9% |
| Decimal Number | 8853 | 0.8% |
| Dash Punctuation | 1645 | 0.1% |
| Close Punctuation | 158 | < 0.1% |
| Open Punctuation | 140 | < 0.1% |
| Final Punctuation | 67 | < 0.1% |
| Control | 33 | < 0.1% |
| Other values (4) | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 104905 | |
| t | 81905 | 9.4% |
| a | 79924 | 9.2% |
| n | 68116 | 7.8% |
| i | 65870 | 7.6% |
| r | 63437 | 7.3% |
| o | 62600 | 7.2% |
| h | 42794 | 4.9% |
| s | 39810 | 4.6% |
| d | 38411 | 4.4% |
| Other values (30) | 221601 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 5796 | |
| C | 2775 | |
| A | 2579 | |
| S | 1531 | 6.1% |
| F | 1286 | 5.1% |
| M | 1207 | 4.8% |
| I | 1063 | 4.2% |
| P | 960 | 3.8% |
| W | 924 | 3.7% |
| N | 861 | 3.4% |
| Other values (16) | 6312 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13487 | |
| , | 5721 | |
| ' | 771 | 3.7% |
| " | 362 | 1.8% |
| / | 170 | 0.8% |
| ? | 59 | 0.3% |
| : | 56 | 0.3% |
| ; | 34 | 0.2% |
| & | 17 | 0.1% |
| % | 3 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2668 | |
| 1 | 1368 | |
| 2 | 1042 | 11.8% |
| 5 | 830 | 9.4% |
| 3 | 820 | 9.3% |
| 4 | 578 | 6.5% |
| 6 | 432 | 4.9% |
| 7 | 416 | 4.7% |
| 8 | 386 | 4.4% |
| 9 | 313 | 3.5% |
Space Separator
| Value | Count | Frequency (%) |
| 179362 | ||
| Â | 7 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 157 | |
| ] | 1 | 0.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 139 | |
| [ | 1 | 0.7% |
Control
| Value | Count | Frequency (%) |
| 32 | ||
| 1 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1645 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 7 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 894667 | |
| Common | 210968 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 104905 | |
| t | 81905 | 9.2% |
| a | 79924 | 8.9% |
| n | 68116 | 7.6% |
| i | 65870 | 7.4% |
| r | 63437 | 7.1% |
| o | 62600 | 7.0% |
| h | 42794 | 4.8% |
| s | 39810 | 4.4% |
| d | 38411 | 4.3% |
| Other values (56) | 246895 |
Common
| Value | Count | Frequency (%) |
| 179362 | ||
| . | 13487 | 6.4% |
| , | 5721 | 2.7% |
| 0 | 2668 | 1.3% |
| - | 1645 | 0.8% |
| 1 | 1368 | 0.6% |
| 2 | 1042 | 0.5% |
| 5 | 830 | 0.4% |
| 3 | 820 | 0.4% |
| ' | 771 | 0.4% |
| Other values (26) | 3254 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1105493 | |
| None | 72 | < 0.1% |
| Punctuation | 70 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 179362 | ||
| e | 104905 | 9.5% |
| t | 81905 | 7.4% |
| a | 79924 | 7.2% |
| n | 68116 | 6.2% |
| i | 65870 | 6.0% |
| r | 63437 | 5.7% |
| o | 62600 | 5.7% |
| h | 42794 | 3.9% |
| s | 39810 | 3.6% |
| Other values (74) | 316770 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 67 | |
| ‘ | 3 | 4.3% |
None
| Value | Count | Frequency (%) |
| é | 20 | |
| á | 15 | |
| Ã | 8 | 11.1% |
| Â | 7 | 9.7% |
| ó | 3 | 4.2% |
| ° | 3 | 4.2% |
| ö | 3 | 4.2% |
| ð | 2 | 2.8% |
| ü | 2 | 2.8% |
| ã | 2 | 2.8% |
| Other values (6) | 7 | 9.7% |
| crew_aboard | crew_fatalities | |
|---|---|---|
| crew_aboard | 1.000 | 0.755 |
| crew_fatalities | 0.755 | 1.000 |
| fecha | HORA declarada | Ruta | OperadOR | flight_no | route | ac_type | registration | cn_ln | all_aboard | PASAJEROS A BORDO | crew_aboard | cantidad de fallecidos | passenger_fatalities | crew_fatalities | ground | summary | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | September 17, 1908 | 1718 | Fort Myer, Virginia | Military - U.S. Army | ? | Demonstration | Wright Flyer III | ? | 1 | 2 | 1 | 1 | 1 | 1 | 0 | 0 | During a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later. |
| 1 | September 07, 1909 | ? | Juvisy-sur-Orge, France | ? | ? | Air show | Wright Byplane | SC1 | ? | 1 | 0 | 1 | 1 | 0 | 0 | 0 | Eugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show. |
| 2 | July 12, 1912 | 0630 | Atlantic City, New Jersey | Military - U.S. Navy | ? | Test flight | Dirigible | ? | ? | 5 | 0 | 5 | 5 | 0 | 5 | 0 | First U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight. |
| 3 | August 06, 1913 | ? | Victoria, British Columbia, Canada | Private | ? | ? | Curtiss seaplane | ? | ? | 1 | 0 | 1 | 1 | 0 | 1 | 0 | The first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed. |
| 4 | September 09, 1913 | 1830 | Over the North Sea | Military - German Navy | ? | ? | Zeppelin L-1 (airship) | ? | ? | 20 | ? | ? | 14 | ? | ? | 0 | The airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants. |
| 5 | October 17, 1913 | 1030 | Near Johannisthal, Germany | Military - German Navy | ? | ? | Zeppelin L-2 (airship) | ? | ? | 28 | ? | ? | 28 | ? | ? | 0 | Hydrogen gas which was being vented was sucked into the forward engine and ignited causing the airship to explode and burn at 3,000 ft..German Navy's Zeppelin airships L-4 and L-5 were blown out to sea in February 1915, never to be seen again. |
| 6 | March 05, 1915 | 0100 | Tienen, Belgium | Military - German Navy | ? | ? | Zeppelin L-8 (airship) | ? | ? | 41 | 0 | 41 | 17 | 0 | 17 | 0 | Crashed into trees while attempting to land after being shot down by British and French aircraft. |
| 7 | September 03, 1915 | 1520 | Off Cuxhaven, Germany | Military - German Navy | ? | ? | Zeppelin L-10 (airship) | ? | ? | 19 | ? | ? | 19 | ? | ? | 0 | Exploded and burned near Neuwerk Island, when hydrogen gas, being vented, was ignited by lightning. |
| 8 | July 28, 1916 | ? | Near Jambol, Bulgeria | Military - German Army | ? | ? | Schutte-Lanz S-L-10 (airship) | ? | ? | 20 | ? | ? | 20 | ? | ? | 0 | Crashed near the Black Sea, cause unknown. |
| 9 | September 24, 1916 | 0100 | Billericay, England | Military - German Navy | ? | ? | Zeppelin L-32 (airship) | ? | ? | 22 | ? | ? | 22 | ? | ? | 0 | Shot down by British aircraft crashing in flames. |
| fecha | HORA declarada | Ruta | OperadOR | flight_no | route | ac_type | registration | cn_ln | all_aboard | PASAJEROS A BORDO | crew_aboard | cantidad de fallecidos | passenger_fatalities | crew_fatalities | ground | summary | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4998 | August 07, 2020 | 1914 | Calicut, India | Air India Exppress | IX344 | Dubai - Calicut | Boeing 737-8HG | VT-AXH | 36323/2108 | 190 | 184 | 6 | 20 | 18 | 2 | 0 | The flight IX344 suffered a runway excursion while landing at Kozhikode-Calicut Airport in heavy rain. The nose section separated from the fuselage after going down a steep slope at the end of the runway. The pilot and copilot were among the dead. Low visibility, wet runway, low cloud base and poor braking action possibly contributed to the accident. |
| 4999 | August 22, 2020 | 0840 | Juba, South Sudan | South West Aviaiton | ? | Juba - Wau | Antonov 26B | EX-126 | 11508 | 8 | 5 | 3 | 7 | 4 | 3 | 0 | The cargo plane lost height shortly after departure from Juba Airport and impacted a farm near Hai Referendum about 3nm southwest of the airport. One passenger survived in critical condition. The plane was chartered by the World Food Program to transport supplies and wages to Wau and Aweil. |
| 5000 | September 25, 2020 | 2050 | Near Chuguev, Ukraine | Military - Ukraine Air Force | ? | Training | Antonov An26SH | 76 yellow | 5608 | 27 | 20 | 7 | 26 | 19 | 7 | 0 | The military transport, crashed 1.2 miles from Chuguev air base. The plane was carrying cadets from a nearby air force university on a training flight. The crew may have reported failure of an engine prior to the accident. |
| 5001 | January 09, 2021 | 1440 | Near Jakarta, Indonesia | Sriwijaya Air | SJ182 | Jakarta - Pontianak | Boeing 737-524 | PK-CLC | 27323/2616 | 62 | 56 | 6 | 62 | 56 | 6 | 0 | Sriwijaya Air flight 182 was climbing through 10,900 ft., 11 nm north of Jakarta-Soekarno-Hatta International Airport, over the Java Sea when radar and radio contact was lost. The aircraft then lost height rapidly and impacted the Java Sea. Debris was located near Lancang Island. |
| 5002 | March 02, 2021 | 1705 | Pieri, Sudan | South Sudan Supreme Airlines | ? | Pieri - Yuai | Let L-410UVP-E | HK-4274 | 902525 | 10 | 8 | 2 | 10 | 8 | 2 | 0 | One of the engines on the aircraft failed 10 minutes after takeof. When the plane turned back, the second engine failed. |
| 5003 | March 28, 2021 | 1835 | Near Butte, Alaska | Soloy Helicopters | ? | Sightseeing Charter | Eurocopter AS350B3Â Ecureuil | N351SH | 4598 | 6 | 5 | 1 | 5 | 4 | 1 | 0 | The sightseeing helicopter crashed after missing the top of a 6,000 ft mountain by just 10 - 15 ft. The crash site was near Knik glacier. The pilot, and four others were killed including Czech billionaire Petr Kellner. |
| 5004 | May 21, 2021 | 1800 | Near Kaduna, Nigeria | Military - Nigerian Air Force | ? | ? | Beechcraft B300 King Air 350i | NAF203 | FL-891 | 11 | 7 | 4 | 11 | 7 | 4 | 0 | While on final approach, in poor weather conditions, the aircraft crashed and burst into flames less than 10 km from Kaduna Airport. All 11 occupants were killed, incuding General Ibrahim Attahiru, Chief of Staff of the Nigerian Army. |
| 5005 | June 10, 2021 | 0800 | Near Pyin Oo Lwin, Myanmar | Military - Myanmar Air Force | ? | Naypyidaw - Anisakan | Beechcraft 1900D | 4610 | E-325 | 14 | 12 | 2 | 12 | 11 | 1 | 0 | The plane was carrying military personnel and monks when it crashed about 300 meters from a steel plant in the Mandalay region. The plane was attempting to land in poor weather conditions and broke into three pieces. |
| 5006 | July 04, 2021 | 11:30 | Patikul, Sulu, Philippines | Military - Philippine Air Force | ? | Cagayan de Oro-Lumbia - Jolo | Lockheed C-130H Hercules | 5125 | 5125 | 96 | 88 | 8 | 50 | ? | ? | 3 | While attempting to land at Jolo Airport, the military transport overran the runway, struck two houses and burst into flames coming to rest on a coconut plantation. |
| 5007 | July 06, 2021 | 1500 | Palana, Russia | Kamchatka Aviation Enterprise | 251 | Petropavlovsk - Palana | Antonov An 26B-100 | RA-26085 | 12310 | 28 | 22 | 6 | 28 | 22 | 6 | 0 | The passenger plane crashed into the top of a cliff while attempting to land in inclement weather. The debris fell into the sea. Contact was lost with the plane 10 minutes before it was to land. |